Overview

Dataset statistics

Number of variables18
Number of observations140
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory28.8 KiB
Average record size in memory210.5 B

Variable types

NUM17
CAT1

Reproduction

Analysis started2020-02-17 20:52:50.978069
Analysis finished2020-02-17 20:53:40.456245
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Neighbourhood has a high cardinality: 140 distinct values High cardinality
11|Youth (15-24 years) is highly correlated with 3|Population, 2016High Correlation
3|Population, 2016 is highly correlated with 11|Youth (15-24 years) and 4 other fieldsHigh Correlation
12|Working Age (25-54 years) is highly correlated with 3|Population, 2016 and 4 other fieldsHigh Correlation
13|Pre-retirement (55-64 years) is highly correlated with 3|Population, 2016 and 1 other fieldsHigh Correlation
14|Seniors (65+ years) is highly correlated with 13|Pre-retirement (55-64 years)High Correlation
1055|Total - Household after-tax income groups in 2015 for private households - 100% data is highly correlated with 3|Population, 2016 and 5 other fieldsHigh Correlation
1973|Total - Commuting duration for the employed labour force aged 15 years and over in private households with a usual place of work or no fixed workplace address - 25% sample data is highly correlated with 3|Population, 2016 and 4 other fieldsHigh Correlation
1974| Less than 15 minutes is highly correlated with 1055|Total - Household after-tax income groups in 2015 for private households - 100% data and 1 other fieldsHigh Correlation
1975| 15 to 29 minutes is highly correlated with 12|Working Age (25-54 years) and 3 other fieldsHigh Correlation
1976| 30 to 44 minutes is highly correlated with 12|Working Age (25-54 years) and 2 other fieldsHigh Correlation

Variables

Neighbourhood
Categorical

HIGH CARDINALITY
UNIFORM
UNIQUE
Distinct count140
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
Woodbine Corridor
 
1
Roncesvalles
 
1
Bedford Park-Nortown
 
1
Regent Park
 
1
Weston-Pelham Park
 
1
Other values (135)
135
ValueCountFrequency (%) 
Woodbine Corridor 1 0.7%
 
Roncesvalles 1 0.7%
 
Bedford Park-Nortown 1 0.7%
 
Regent Park 1 0.7%
 
Weston-Pelham Park 1 0.7%
 
Woburn 1 0.7%
 
Markland Wood 1 0.7%
 
Willowridge-Martingrove-Richview 1 0.7%
 
Thorncliffe Park 1 0.7%
 
Princess-Rosethorn 1 0.7%
 
Other values (130) 130 92.9%
 

Length

Max length35
Mean length16.61428571
Min length5
ValueCountFrequency (%) 
Uppercase_Letter 24 44.4%
 
Lowercase_Letter 23 42.6%
 
Other_Punctuation 3 5.6%
 
Close_Punctuation 1 1.9%
 
Space_Separator 1 1.9%
 
Dash_Punctuation 1 1.9%
 
Open_Punctuation 1 1.9%
 
ValueCountFrequency (%) 
Latin 47 87.0%
 
Common 7 13.0%
 
ValueCountFrequency (%) 
ASCII 54 100.0%
 

1|Neighbourhood Number
Real number (ℝ≥0)

UNIFORM
UNIQUE
Distinct count140
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean70.5
Minimum1
Maximum140
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum1
5-th percentile7.95
Q135.75
median70.5
Q3105.25
95-th percentile133.05
Maximum140
Range139
Interquartile range (IQR)69.5

Descriptive statistics

Standard deviation40.55859958
Coefficient of variation (CV)0.5752992848
Kurtosis-1.2
Mean70.5
Median Absolute Deviation (MAD)35
Skewness0
Sum9870
Variance1645
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 140.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
140 1 0.7%
 
44 1 0.7%
 
50 1 0.7%
 
49 1 0.7%
 
48 1 0.7%
 
47 1 0.7%
 
46 1 0.7%
 
45 1 0.7%
 
43 1 0.7%
 
52 1 0.7%
 
Other values (130) 130 92.9%
 
ValueCountFrequency (%) 
1 1 0.7%
 
2 1 0.7%
 
3 1 0.7%
 
4 1 0.7%
 
5 1 0.7%
 
ValueCountFrequency (%) 
140 1 0.7%
 
139 1 0.7%
 
138 1 0.7%
 
137 1 0.7%
 
136 1 0.7%
 

3|Population, 2016
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE
Distinct count140
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19511.22143
Minimum6577
Maximum65913
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum6577
5-th percentile9264.35
Q112019.5
median16749.5
Q323854.5
95-th percentile36983.45
Maximum65913
Range59336
Interquartile range (IQR)11835

Descriptive statistics

Standard deviation10033.58922
Coefficient of variation (CV)0.5142471095
Kurtosis3.808172145
Mean19511.22143
Median Absolute Deviation (MAD)7593.983163
Skewness1.667986755
Sum2731571
Variance100672912.7
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 6577. 9249.5 17968.5 34928.5 65913. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
26572 1 0.7%
 
21567 1 0.7%
 
14417 1 0.7%
 
15179 1 0.7%
 
10732 1 0.7%
 
11848 1 0.7%
 
15683 1 0.7%
 
26984 1 0.7%
 
30526 1 0.7%
 
46496 1 0.7%
 
Other values (130) 130 92.9%
 
ValueCountFrequency (%) 
6577 1 0.7%
 
7607 1 0.7%
 
7727 1 0.7%
 
7804 1 0.7%
 
7865 1 0.7%
 
ValueCountFrequency (%) 
65913 1 0.7%
 
53485 1 0.7%
 
50434 1 0.7%
 
46496 1 0.7%
 
43993 1 0.7%
 
Distinct count127
Unique (%)90.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.501714286
Minimum0.42
Maximum36.89
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum0.42
5-th percentile1.17
Q11.8525
median3.275
Q35.3825
95-th percentile9.9935
Maximum36.89
Range36.47
Interquartile range (IQR)3.53

Descriptive statistics

Standard deviation4.544665138
Coefficient of variation (CV)1.009540999
Kurtosis24.41673535
Mean4.501714286
Median Absolute Deviation (MAD)2.694963265
Skewness4.196053368
Sum630.24
Variance20.65398121
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0.42 1.355 1.935 5.525 10.115 36.89 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1.59 2 1.4%
 
3.46 2 1.4%
 
1.83 2 1.4%
 
5.2 2 1.4%
 
1.52 2 1.4%
 
1.68 2 1.4%
 
7.83 2 1.4%
 
4.7 2 1.4%
 
1.17 2 1.4%
 
3.1 2 1.4%
 
Other values (117) 120 85.7%
 
ValueCountFrequency (%) 
0.42 1 0.7%
 
0.64 1 0.7%
 
0.9 1 0.7%
 
0.95 1 0.7%
 
1.01 1 0.7%
 
ValueCountFrequency (%) 
36.89 1 0.7%
 
29.81 1 0.7%
 
16.21 1 0.7%
 
15 1 0.7%
 
13.23 1 0.7%
 

10|Children (0-14 years)
Real number (ℝ≥0)

Distinct count129
Unique (%)92.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2843.964286
Minimum565
Maximum9625
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum565
5-th percentile1257.5
Q11695
median2405
Q33567.5
95-th percentile5821
Maximum9625
Range9060
Interquartile range (IQR)1872.5

Descriptive statistics

Standard deviation1546.225445
Coefficient of variation (CV)0.5436866606
Kurtosis3.090127023
Mean2843.964286
Median Absolute Deviation (MAD)1177.286735
Skewness1.5336706
Sum398155
Variance2390813.128
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 565. 1135. 2440. 4617.5 9625. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1695 3 2.1%
 
2050 2 1.4%
 
1470 2 1.4%
 
1610 2 1.4%
 
3505 2 1.4%
 
2115 2 1.4%
 
1960 2 1.4%
 
2325 2 1.4%
 
1675 2 1.4%
 
1745 2 1.4%
 
Other values (119) 119 85.0%
 
ValueCountFrequency (%) 
565 1 0.7%
 
800 1 0.7%
 
1120 1 0.7%
 
1150 1 0.7%
 
1165 1 0.7%
 
ValueCountFrequency (%) 
9625 1 0.7%
 
7960 1 0.7%
 
7910 1 0.7%
 
7090 1 0.7%
 
6120 1 0.7%
 

11|Youth (15-24 years)
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count130
Unique (%)92.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2430.928571
Minimum675
Maximum7840
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum675
5-th percentile920
Q11428.75
median2100
Q33022.5
95-th percentile5459.25
Maximum7840
Range7165
Interquartile range (IQR)1593.75

Descriptive statistics

Standard deviation1457.994778
Coefficient of variation (CV)0.5997686624
Kurtosis2.98582713
Mean2430.928571
Median Absolute Deviation (MAD)1056.265306
Skewness1.697447082
Sum340330
Variance2125748.772
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 675. 2530. 3427.5 7840. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2175 2 1.4%
 
1305 2 1.4%
 
1355 2 1.4%
 
1465 2 1.4%
 
920 2 1.4%
 
1065 2 1.4%
 
2275 2 1.4%
 
2225 2 1.4%
 
2185 2 1.4%
 
1035 2 1.4%
 
Other values (120) 120 85.7%
 
ValueCountFrequency (%) 
675 1 0.7%
 
735 1 0.7%
 
855 1 0.7%
 
885 1 0.7%
 
905 1 0.7%
 
ValueCountFrequency (%) 
7840 1 0.7%
 
7660 1 0.7%
 
6940 1 0.7%
 
6860 1 0.7%
 
6700 1 0.7%
 

12|Working Age (25-54 years)
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count136
Unique (%)97.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8783.678571
Minimum2750
Maximum45105
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum2750
5-th percentile3604.75
Q15465
median7475
Q310588.75
95-th percentile18523.5
Maximum45105
Range42355
Interquartile range (IQR)5123.75

Descriptive statistics

Standard deviation5423.203831
Coefficient of variation (CV)0.6174182931
Kurtosis14.33383694
Mean8783.678571
Median Absolute Deviation (MAD)3669.62449
Skewness2.898162335
Sum1229715
Variance29411139.79
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 2750. 12485. 22632.5 45105. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
6050 2 1.4%
 
5860 2 1.4%
 
3790 2 1.4%
 
7470 2 1.4%
 
6655 1 0.7%
 
11860 1 0.7%
 
7005 1 0.7%
 
18780 1 0.7%
 
3615 1 0.7%
 
19790 1 0.7%
 
Other values (126) 126 90.0%
 
ValueCountFrequency (%) 
2750 1 0.7%
 
3090 1 0.7%
 
3245 1 0.7%
 
3310 1 0.7%
 
3370 1 0.7%
 
ValueCountFrequency (%) 
45105 1 0.7%
 
25850 1 0.7%
 
23320 1 0.7%
 
21945 1 0.7%
 
20640 1 0.7%
 

13|Pre-retirement (55-64 years)
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count124
Unique (%)88.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2404.464286
Minimum650
Maximum6690
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum650
5-th percentile1148.25
Q11591.25
median2025
Q33078.75
95-th percentile4623
Maximum6690
Range6040
Interquartile range (IQR)1487.5

Descriptive statistics

Standard deviation1161.127227
Coefficient of variation (CV)0.482904751
Kurtosis1.833031126
Mean2404.464286
Median Absolute Deviation (MAD)904.5790816
Skewness1.321336415
Sum336625
Variance1348216.438
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 650. 1172.5 2025. 3632.5 6690. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1760 3 2.1%
 
2020 3 2.1%
 
1750 2 1.4%
 
2430 2 1.4%
 
1435 2 1.4%
 
1725 2 1.4%
 
1825 2 1.4%
 
1625 2 1.4%
 
3030 2 1.4%
 
1325 2 1.4%
 
Other values (114) 118 84.3%
 
ValueCountFrequency (%) 
650 1 0.7%
 
885 1 0.7%
 
940 1 0.7%
 
970 1 0.7%
 
1050 1 0.7%
 
ValueCountFrequency (%) 
6690 1 0.7%
 
6245 1 0.7%
 
5930 1 0.7%
 
5535 1 0.7%
 
5460 1 0.7%
 

14|Seniors (65+ years)
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count129
Unique (%)92.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3048.285714
Minimum730
Maximum8990
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum730
5-th percentile1273.5
Q11876.25
median2652.5
Q33768.75
95-th percentile6050.75
Maximum8990
Range8260
Interquartile range (IQR)1892.5

Descriptive statistics

Standard deviation1579.021618
Coefficient of variation (CV)0.5180031552
Kurtosis1.299532337
Mean3048.285714
Median Absolute Deviation (MAD)1244.942857
Skewness1.167086981
Sum426760
Variance2493309.27
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 730. 1232.5 3790. 6215. 8990. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5250 2 1.4%
 
2705 2 1.4%
 
4165 2 1.4%
 
1785 2 1.4%
 
2015 2 1.4%
 
2550 2 1.4%
 
2695 2 1.4%
 
2160 2 1.4%
 
1865 2 1.4%
 
1325 2 1.4%
 
Other values (119) 120 85.7%
 
ValueCountFrequency (%) 
730 1 0.7%
 
895 1 0.7%
 
965 1 0.7%
 
1025 1 0.7%
 
1095 1 0.7%
 
ValueCountFrequency (%) 
8990 1 0.7%
 
8010 1 0.7%
 
7405 1 0.7%
 
6975 1 0.7%
 
6625 1 0.7%
 

15|Older Seniors (85+ years)
Real number (ℝ≥0)

Distinct count93
Unique (%)66.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean471.0357143
Minimum50
Maximum1640
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum50
5-th percentile140
Q1258.75
median367.5
Q3656.25
95-th percentile957.25
Maximum1640
Range1590
Interquartile range (IQR)397.5

Descriptive statistics

Standard deviation297.9634088
Coefficient of variation (CV)0.6325707366
Kurtosis1.726521946
Mean471.0357143
Median Absolute Deviation (MAD)238.3857143
Skewness1.243586523
Sum65945
Variance88782.19296
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 50. 137.5 405. 977.5 1640. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
140 5 3.6%
 
255 3 2.1%
 
195 3 2.1%
 
265 3 2.1%
 
300 3 2.1%
 
325 3 2.1%
 
330 3 2.1%
 
365 3 2.1%
 
400 3 2.1%
 
260 3 2.1%
 
Other values (83) 108 77.1%
 
ValueCountFrequency (%) 
50 1 0.7%
 
95 1 0.7%
 
115 1 0.7%
 
125 1 0.7%
 
135 1 0.7%
 
ValueCountFrequency (%) 
1640 1 0.7%
 
1480 1 0.7%
 
1345 1 0.7%
 
1220 1 0.7%
 
1130 1 0.7%
 
Distinct count138
Unique (%)98.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7949.392857
Minimum2650
Maximum40765
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum2650
5-th percentile3447.25
Q15142.5
median6567.5
Q39532.5
95-th percentile17513.5
Maximum40765
Range38115
Interquartile range (IQR)4390

Descriptive statistics

Standard deviation4795.355722
Coefficient of variation (CV)0.6032354682
Kurtosis15.53209871
Mean7949.392857
Median Absolute Deviation (MAD)3221.219388
Skewness3.014411638
Sum1112915
Variance22995436.5
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 2650. 10335. 19510. 40765.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5440 2 1.4%
 
9190 2 1.4%
 
2650 1 0.7%
 
4950 1 0.7%
 
3925 1 0.7%
 
9930 1 0.7%
 
10065 1 0.7%
 
6480 1 0.7%
 
5455 1 0.7%
 
15320 1 0.7%
 
Other values (128) 128 91.4%
 
ValueCountFrequency (%) 
2650 1 0.7%
 
3120 1 0.7%
 
3125 1 0.7%
 
3215 1 0.7%
 
3240 1 0.7%
 
ValueCountFrequency (%) 
40765 1 0.7%
 
22305 1 0.7%
 
19690 1 0.7%
 
19330 1 0.7%
 
18780 1 0.7%
 

1071| $100,000 and over
Real number (ℝ≥0)

Distinct count122
Unique (%)87.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1881.571429
Minimum300
Maximum10415
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum300
5-th percentile550
Q1998.75
median1630
Q32250
95-th percentile4233
Maximum10415
Range10115
Interquartile range (IQR)1251.25

Descriptive statistics

Standard deviation1264.534647
Coefficient of variation (CV)0.6720630572
Kurtosis14.15939248
Mean1881.571429
Median Absolute Deviation (MAD)865.144898
Skewness2.748219876
Sum263420
Variance1599047.873
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 300. 2337.5 4792.5 10415. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1470 3 2.1%
 
1585 3 2.1%
 
905 3 2.1%
 
1745 3 2.1%
 
625 2 1.4%
 
2120 2 1.4%
 
980 2 1.4%
 
1000 2 1.4%
 
745 2 1.4%
 
550 2 1.4%
 
Other values (112) 116 82.9%
 
ValueCountFrequency (%) 
300 1 0.7%
 
410 1 0.7%
 
510 1 0.7%
 
515 2 1.4%
 
540 1 0.7%
 
ValueCountFrequency (%) 
10415 1 0.7%
 
4830 1 0.7%
 
4755 1 0.7%
 
4705 1 0.7%
 
4625 1 0.7%
 
Distinct count134
Unique (%)95.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8936.071429
Minimum2655
Maximum43785
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum2655
5-th percentile3848.75
Q15836.25
median7437.5
Q310652.5
95-th percentile19096.75
Maximum43785
Range41130
Interquartile range (IQR)4816.25

Descriptive statistics

Standard deviation5226.734051
Coefficient of variation (CV)0.5849028953
Kurtosis13.80403027
Mean8936.071429
Median Absolute Deviation (MAD)3568.27551
Skewness2.827881492
Sum1251050
Variance27318748.84
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 2655. 12692.5 21927.5 43785. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
7580 2 1.4%
 
9365 2 1.4%
 
4805 2 1.4%
 
8075 2 1.4%
 
7260 2 1.4%
 
6795 2 1.4%
 
4095 1 0.7%
 
19090 1 0.7%
 
11355 1 0.7%
 
5720 1 0.7%
 
Other values (124) 124 88.6%
 
ValueCountFrequency (%) 
2655 1 0.7%
 
3225 1 0.7%
 
3375 1 0.7%
 
3465 1 0.7%
 
3500 1 0.7%
 
ValueCountFrequency (%) 
43785 1 0.7%
 
22060 1 0.7%
 
21795 1 0.7%
 
21595 1 0.7%
 
21500 1 0.7%
 

1974| Less than 15 minutes
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count108
Unique (%)77.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1091.285714
Minimum255
Maximum9230
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum255
5-th percentile369.75
Q1653.75
median847.5
Q31301.25
95-th percentile2360.75
Maximum9230
Range8975
Interquartile range (IQR)647.5

Descriptive statistics

Standard deviation922.6489413
Coefficient of variation (CV)0.8454696412
Kurtosis43.53039805
Mean1091.285714
Median Absolute Deviation (MAD)528.6183673
Skewness5.40034264
Sum152780
Variance851281.0689
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 255. 1082.5 1767.5 2957.5 9230. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
530 4 2.9%
 
1210 4 2.9%
 
770 3 2.1%
 
415 3 2.1%
 
655 3 2.1%
 
765 2 1.4%
 
345 2 1.4%
 
900 2 1.4%
 
850 2 1.4%
 
1550 2 1.4%
 
Other values (98) 113 80.7%
 
ValueCountFrequency (%) 
255 1 0.7%
 
285 1 0.7%
 
340 1 0.7%
 
345 2 1.4%
 
360 1 0.7%
 
ValueCountFrequency (%) 
9230 1 0.7%
 
3740 1 0.7%
 
3175 1 0.7%
 
2740 1 0.7%
 
2695 1 0.7%
 

1975| 15 to 29 minutes
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count125
Unique (%)89.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2532.571429
Minimum545
Maximum18250
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum545
5-th percentile964.25
Q11507.5
median2012.5
Q33028.75
95-th percentile5336.25
Maximum18250
Range17705
Interquartile range (IQR)1521.25

Descriptive statistics

Standard deviation1874.130636
Coefficient of variation (CV)0.7400109688
Kurtosis35.29370353
Mean2532.571429
Median Absolute Deviation (MAD)1128.734694
Skewness4.725333153
Sum354560
Variance3512365.642
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 545. 1277.5 2042.5 3292.5 6097.5 18250. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1500 2 1.4%
 
1520 2 1.4%
 
1560 2 1.4%
 
1750 2 1.4%
 
3165 2 1.4%
 
2020 2 1.4%
 
1350 2 1.4%
 
2340 2 1.4%
 
4135 2 1.4%
 
2265 2 1.4%
 
Other values (115) 120 85.7%
 
ValueCountFrequency (%) 
545 1 0.7%
 
770 1 0.7%
 
860 1 0.7%
 
865 1 0.7%
 
900 1 0.7%
 
ValueCountFrequency (%) 
18250 1 0.7%
 
7395 1 0.7%
 
7310 1 0.7%
 
6260 1 0.7%
 
5935 1 0.7%
 

1976| 30 to 44 minutes
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count120
Unique (%)85.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2566.142857
Minimum785
Maximum9955
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum785
5-th percentile1167.25
Q11630
median2220
Q32996.25
95-th percentile5297
Maximum9955
Range9170
Interquartile range (IQR)1366.25

Descriptive statistics

Standard deviation1405.353247
Coefficient of variation (CV)0.5476519919
Kurtosis6.340905787
Mean2566.142857
Median Absolute Deviation (MAD)976.3326531
Skewness2.104785853
Sum359260
Variance1975017.749
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 785. 3130. 4610. 9955.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3000 3 2.1%
 
1215 3 2.1%
 
2200 2 1.4%
 
2195 2 1.4%
 
1205 2 1.4%
 
2215 2 1.4%
 
1865 2 1.4%
 
2610 2 1.4%
 
1840 2 1.4%
 
2255 2 1.4%
 
Other values (110) 118 84.3%
 
ValueCountFrequency (%) 
785 1 0.7%
 
955 1 0.7%
 
965 1 0.7%
 
1000 1 0.7%
 
1075 1 0.7%
 
ValueCountFrequency (%) 
9955 1 0.7%
 
7290 1 0.7%
 
7030 1 0.7%
 
6665 1 0.7%
 
6380 1 0.7%
 

1977| 45 to 59 minutes
Real number (ℝ≥0)

Distinct count115
Unique (%)82.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1297.464286
Minimum320
Maximum4265
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum320
5-th percentile494.5
Q1785
median1107.5
Q31615
95-th percentile2667
Maximum4265
Range3945
Interquartile range (IQR)830

Descriptive statistics

Standard deviation730.9374823
Coefficient of variation (CV)0.5633584603
Kurtosis3.147362776
Mean1297.464286
Median Absolute Deviation (MAD)545.7408163
Skewness1.577767996
Sum181645
Variance534269.603
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 320. 1255. 2342.5 4265. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
915 4 2.9%
 
1410 3 2.1%
 
1240 3 2.1%
 
630 3 2.1%
 
1170 3 2.1%
 
1615 3 2.1%
 
1100 3 2.1%
 
1000 3 2.1%
 
875 2 1.4%
 
675 2 1.4%
 
Other values (105) 111 79.3%
 
ValueCountFrequency (%) 
320 1 0.7%
 
340 1 0.7%
 
435 1 0.7%
 
440 1 0.7%
 
460 1 0.7%
 
ValueCountFrequency (%) 
4265 1 0.7%
 
4175 1 0.7%
 
3335 1 0.7%
 
3320 1 0.7%
 
3185 1 0.7%
 

1978| 60 minutes and over
Real number (ℝ≥0)

Distinct count122
Unique (%)87.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1448.571429
Minimum190
Maximum5530
Zeros0
Zeros (%)0.0%
Memory size1.2 KiB

Quantile statistics

Minimum190
5-th percentile368.25
Q1775
median1182.5
Q31752.5
95-th percentile3137.25
Maximum5530
Range5340
Interquartile range (IQR)977.5

Descriptive statistics

Standard deviation998.310855
Coefficient of variation (CV)0.6891692293
Kurtosis3.782290718
Mean1448.571429
Median Absolute Deviation (MAD)735.6428571
Skewness1.695676593
Sum202800
Variance996624.5632
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 190. 1822.5 3430. 5530. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
775 2 1.4%
 
1290 2 1.4%
 
1675 2 1.4%
 
855 2 1.4%
 
1565 2 1.4%
 
615 2 1.4%
 
1680 2 1.4%
 
895 2 1.4%
 
505 2 1.4%
 
740 2 1.4%
 
Other values (112) 120 85.7%
 
ValueCountFrequency (%) 
190 1 0.7%
 
220 1 0.7%
 
260 1 0.7%
 
295 1 0.7%
 
310 1 0.7%
 
ValueCountFrequency (%) 
5530 1 0.7%
 
5320 1 0.7%
 
5300 1 0.7%
 
4055 1 0.7%
 
3500 1 0.7%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

Neighbourhood1|Neighbourhood Number3|Population, 20169|Land area in square kilometres10|Children (0-14 years)11|Youth (15-24 years)12|Working Age (25-54 years)13|Pre-retirement (55-64 years)14|Seniors (65+ years)15|Older Seniors (85+ years)1055|Total - Household after-tax income groups in 2015 for private households - 100% data1071| $100,000 and over1973|Total - Commuting duration for the employed labour force aged 15 years and over in private households with a usual place of work or no fixed workplace address - 25% sample data1974| Less than 15 minutes1975| 15 to 29 minutes1976| 30 to 44 minutes1977| 45 to 59 minutes1978| 60 minutes and over
0West Humber-Clairville13331229.815060544513845399049806151029023601557023254650395516153010
1Mount Olive-Silverstone-Jamestown2329544.52709052401361534753560300987512051259514253825326513452730
2Thistletown-Beaumond Heights3103603.31173014104160119518803503280725434565513151215485685
3Rexdale-Kipling4105292.49164013554300152017303003845750473568015001115560875
4Elms-Old Rexdale594562.8618051440370012551275145321551540403651325965540845
5Kingsview Village-The Westway6220005.05424030208635255035855757775140588251225281521559701675
6Willowridge-Martingrove-Richview7221565.533555262581402905490588585102245936511553165232011701545
7Humber Heights-Westmount8109482.751450114037901510304595041351000435036014051260595740
8Edenbridge-Humber Valley9155355.47212018055940238532906656260212070958102000207011951030
9Princess-Rosethorn10110515.171770158038251855202532538652165480558515601215875565

Last rows

Neighbourhood1|Neighbourhood Number3|Population, 20169|Land area in square kilometres10|Children (0-14 years)11|Youth (15-24 years)12|Working Age (25-54 years)13|Pre-retirement (55-64 years)14|Seniors (65+ years)15|Older Seniors (85+ years)1055|Total - Household after-tax income groups in 2015 for private households - 100% data1071| $100,000 and over1973|Total - Commuting duration for the employed labour force aged 15 years and over in private households with a usual place of work or no fixed workplace address - 25% sample data1974| Less than 15 minutes1975| 15 to 29 minutes1976| 30 to 44 minutes1977| 45 to 59 minutes1978| 60 minutes and over
130Rouge1314649636.897960670018510669066256851339548302150023555120514533355530
131Malvern132437948.857910662017865553558904451343024251922525204485455523605300
132Centennial Scarborough133133625.3921501850503019552385160437520906170670146514109251705
133Highland Creek134124945.2015451925462020202395190370015855720625146014506751510
134Morningside135174555.74288026906840235526953555885113074507651750184011002010
135West Hill136273929.59463539501076537854240625998517451105010152455257517403275
136Woburn1375348512.3196257660219456245801011301843028652159527405360528028955320
137Eglinton East138227763.2341803130918028253505560790599592759401840230017602435
138Scarborough Village139167243.1033652360668520952225430593079062456101415143010251775
139Guildwood14099173.711295106533701555263554039951320416541586510956751115